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Abstract 

Pellet's theorem determines when the zeros of a polynomial can be separated into 
two regions, based on the presence or absence of positive roots of an auxiliary poly- 
nomial, but does not provide a method to verify its conditions or to compute the 
roots of the auxiliary polynomial when they exist. We derive an explicit condition 
for these roots to exist and, when they do, propose efhcient ways to compute them. 
A similar auxiliary polynomial appears for the generalized Pellet theorem for matrix 
polynomials and it can be treated in the same way. 
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1 Introduction 

Pellet's theorem, a classical result from 1881, derives conditions under which the zeros of 
a polynomial can be divided into two groups, according to their magnitudes. It is a direct 
consequence of Rouche's theorem and is stated as follows. 

Theorem 1.1 (JB^, g| Th.(28,l), p. 128]) Given the polynomial p{z) = + an-iz'^~'^ + 
• • • + oiz + oo with complex coefficients, n>3, l<fc<n — 1, and agak 7^ 0. Let the 
polynomial 

(p{x) = x" + |an-i|x"~^ H h \ak+i\x''+'^ - \ak\x'' + |ofc_i|x''~^ H h jao| 

have two distinct positive roots r and R, r < R. Then p has exactly k zeros in or on the 
circle \z\ = r and no zeros in the annular ring r < \z\ < R. 

We note that, by Descartes' rule, 4> has either two or no positive roots. Although the 
function <f) depends on k, this parameter remains fixed throughout the paper, and it will 
therefore be omitted from the notation to prevent unnecessary clutter. 

Recently, ([2], [5]) a generalized Pellet theorem was derived for matrix polynomials, 
which have received a lot of attention recently because of their application in several 
engineering fields ([7]). Matrix polynomials are polynomials whose coefficients are matrices 
instead of scalars. They occur in polynomial eigenvalue problems, which consist of finding 
a nonzero eigenvector v, corresponding to an eigenvalue z satisfying P{z)v = 0, where 

P{Z) = AnZ"" + An-iz""-^ + • • • + ^0, 
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with Aj G (27*"^"^ for j = 0, . . . n, and with det {P{z)) not identically zero. If An is singular 
then P has infinite eigenvalues and if Aq is singular then zero is an eigenvalue. There are 
nm eigenvalues, including possibly infinite ones. The finite eigenvalues are the solutions of 
det {P{z)) = 0. The following theorem generalizes Pellet's theorem to matrix polynomials. 

Theorem 1.2 (Generalized Pellet theorem.) (f^, fS^) Let 

P{z) = + A„_iz"-i + ... + Aiz + Aq 

he a matrix polynomial with n > 2, Aj S aj'^^ni j = q, . . . , n, and Aq ^ 0. Let Ak be 
invertible for some k with 1 < k < n — 1, and let the polynomial 

g{x) = • ■+\\Ak+i\\x^+'^-\\Al^\\'~'^x^+\\Ak-.i\\x^-^+- ■ •+Pi||x+Po 

where the norm can he any vector-induced norm, have two distinct positive roots r and R, 
r < R. Then P has exactly km zeros in or on the disk \z\ = r and no zeros in the annular 
ring r < jz| < R. 

Its generalization widens the usefulness of Pellet's theorem considerably, as the compu- 
tation of nonlinear eigenvalues is much more costly than the computation of polynomial 
zeros, making easily computed bounds more valuable. 

However, both the classical and generalized Pellet theorems do not explain how their 
assumptions can be verified, nor do they provide a method to compute the roots r and 
R when the assumptions are satisfied. In addition, as will be explained below, we require 
certain properties of any iterates generated in such computations. Our sole purpose is to 
address these shortcomings, thereby improving the applicability of these theorems. We 
stress that we focus on the implementation of Pellet's theorem, if and when it is used. 
When and where it is worthwile to use the theorem is a consideration that lies well outside 
the scope of this work. 

Because the real polynomials (f) and g are of the exact same form, the techniques we 
will develop apply to both Theorem 11.11 and Theorem ll.2[ To keep matters simple, we will 
henceforth refer only to Theorem II. H with the understanding that all results carry over 
in a straightforward way to Theorem 11.21 

Figure [T] shows a few typical scenarios for the function (/) in Theorem 11.11 on the left 
(j) increases at first, then decreases , intersects the x-axis and then increases while crossing 
the X-axis again; in the middle, (f) behaves as on the left, but does not decrease enough 
to have positive roots; on the right (p increases monotonically. The presence or lack of 
positive roots needs to be detected first, and, if there are such roots, then they need to 
be computed with a method that generates iterates that are themselves proper bounds, 
which means that r and R need to be approached from inside the interval [r, R] . In this 
way, the numerical process can be stopped at any time with an upper bound on r and a 
lower bound on R, allowing for an inexact solution while still providing correct bounds on 
the two groups of k and n — k zeros of p. We consider this a key property to be satisfied 
and, although there exist many methods to compute the real roots of a polynomial, none 
of them accomplishes this. 
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We derive an easily computable criterion for the absence or presence of positive roots of 
(j) in Theorem II .11 which has the important additional advantage of providing an adequate 
starting point in [r, i2] for the computation of the positive roots of (p. This allows us 
to reformulate Pellet's theorem in a more useful way and to propose a framework for 
generating efficient methods to compute the roots, when they exist, from inside [r, R\ for 
a particular given value of k. The complexity of detecting whether has positive roots or 
not is, as we will see later, 0{n). This means that, if the theorem needs to be applied for 
every value of k, with 1 < k < n — 1, then this complexity becomes O(n^), which makes 
it more efficient to first use a result by [T] . In [T] , it was shown that if (f) has two real roots 
for a particular value of fc, then k must be the abscissa of a vertex of the Newton polygon 
associated with p. Moreover, computing these vertices only costs 0{n\ogn) operations 
([3]). In such a case, we would first compute the abscissae /ci, ...km of the vertices of the 
associated Newton polygon and then compute the roots r and R only for the values of 
k e {ki, ...,km}. Once the abscissae have been computed, the complexity of detecting 
the positive roots is then 0{mn). Typically, m <^ n since only polynomials with very 
special coefficients have more than just a few values of k for which Pellet's theorem can 
be applied. 

From now on, we compute the roots of (p for a given value of k which remains fixed 
throughout this work. Detection of the roots and the determination of a starting point is 
the subject of Section O while a strategy to compute the roots is developed in Section [3l 
We believe that the ideas behind our techniques are general enough to be useful in other 
situations as well. 




Figure 1: Typical scenarios for (p. 

2 Detection of the roots 

Our strategy to determine if (p has positive roots is to first transform it to a strictly convex 
function x with the same roots or absence of roots as (p. Denoting x's unique minimizer 
by X*, we conclude that (p has two positive roots if and only if (p(x*) < 0. We note that 
this minimizer does not, in general, correspond to the minimizer of (p (if it exists). This 
leads to the following theorem. 
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Theorem 2.1 Given the polynomial 

(j)(x) = x" + 7?„--ix""^ + • • • + m+ix'''^^ - Vkx'' + ??fc-ix^~^ + --- + V0 , 

with rjj > (j = 1, ...,nj, r/or/fc 7^ 0, n > 3, and 1 < fc < n — 1. Then cp has two positive 
roots if and only if (p{x*) < 0, where x* is the unique positive root of the polynomial 

X{x) = {n-k)x'' + {n-k-l)r]n-ix''-^ + ■ ■ ■ + r]k+ix''+^ -r]k-ix''~^ -2r]k-2x''~^ • 

Proof. We define ■iIj{x) for x G (0,+oo) as 

iIj(x) = x-^(t){x) = + + • • • + r/fc+ix -rik+ Vk-ix~^ + ■■■+ rjox'^ , 

and observe tliat it is a strictly convex function with the same positive roots or lack thereof 
as (j). This function will therefore have a unique positive minimizer, which we denote by 
x* , and it will have two positive roots if and only if ^(x*) < 0, which is equivalent to 
(p{x*) < 0. Because x* is the unique minimizer of a strictly convex differentiable function, 
it can be obtained as the unique positive solution of ■ip'{x) = 0. Since 

^'(x) = (n - /c)x"~''~^ + {n-k- l)??„_ix"~^-2 + • • • + m+i - m^i^""^ + krjoX~''~^ , 

and since xp'{x) = <^=^ x''~^^ip'{x) = 0, the statement of the theorem follows. O 
Defining 

a{x) = + r/n-ix"-^ + • • • + r/fc+ix^+^ + Vk-i^'''^ + ••• + %, 

we remark that Theorem 12 . 1 1 can also be obtained from the observation that the existence 
of n > for which 0(n) < is equivalent to the existence of u > for which \aj^\ > a{u)/u^ . 
The function in the right-hand side of this inequality is strictly convex and has a unique 
minimizer u* . Then a{u*) / {u*)^ represents the lowest strict threshhold value for |afc| to 
guarantee the existence of some u > for which (j){u) < 0. 

Since x* is independent of the value of r/^, it is possible to establish a criterion for 
r/fc that guarantees positive roots for (f), and that depends only on the other coefficients. 
Incorporating this in Pellet's theorem results in the following more explicit version of that 
same theorem. 

Theorem 2.2 Given the polynomial p{z) = z"' + an-\z^~^ + ■ ■ ■ + aiz + oq with complex 
coefficients, a^ak 7^ 0, and n > 3. Let 1 < k < n — 1, 

(j)[x) = x" + |a„_i|x"~^ H h |afc+i|x^+^ - |aA..|x^' + |afe„i|x''~^ H h |ao| , 

cr(x) = x" + |a„_i|x""^ H h |afc+i|x''+^ + lafc.ijx''"^ H h |ao| , 

and let x* be the unique positive root of the polynomial 

X(x) = (n-A;)x" + (n-A:-l)|a„_i!x"~^H h|afc+i|x''+^-|afc-i|x''"^-2|afc_2|x'="^ k\ao 

If 



then the polynomial (f) has two distinct positive roots r and R with r < R, and the poly- 
nomial p has exactly k zeros in or on the circle \z\ = r and no zeros in the annular ring 
r < |z| < R. 

The polynomial x appearing in the two previous theorems, is strictly convex to the 
right of its positive root, so that, to compute it, Newton's method or a suitably accelerated 
version of it can be used with guaranteed monotonic convergence from the right of the 
root. As a starting point, any easily computable upper bound on the roots of (p can be 
used. Such a bound can be found with the following theorem. 

Theorem 2.3 Let 

= + r/„_ix"~^ H h rjmx'^ - r]iX^ - % , 

with r]j > (j = 1, ■■■,n), m > £, rjQrjirjrnrjn 7^ 0, and n > 3. Then the unique positive root 
X* of uj satisfies the following. 
(1) Ifu}{l) < 0, then 

l^^* ^ ( Ve + V£^i + --- + Vo \ ^ . 
~ \r]n + Vn-i H l-rjmj 



(2) Ifuj{l) > 0, then 

\Vn + Vn-l H l-VniJ 

Proof. By Descartes' rule of signs, the polynomial ui has a single positive root x* and is 
therefore negative for < x < x* . If uj{l) = 0, then x* = 1. If uj{l) < 0, then x* > 1, and 
for X > 1 we have that 

^(x) > (??„ + r]n-i H h r]m) X™ - {r]i + iji^i H \- r]o) x^ , 

with the equality holding for x = 1. The unique positive root of the right-hand side then 
provides an upper bound on the positive root of u. 

If a;(l) > 0, then x* < 1, and for 3; < 1 we have that 

uj{x) > (?/„ + r]n-i H h 2;" - (r/£ + H h , 

with equality for x = 1. The unique positive root of the right-hand side once again provides 
an upper bound on the positive root of uj. That it is less than 1 follows from uj{l) > 0. 
This concludes the proof. O 

Applying this theorem to the aforementioned polynomial x directly yields the following 
corollary. 
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Corollary 2.1 Let 



X{x) = {n-k)x^ + {n-k-l)rfn^ix'^ ^ + - • • + ^-2rjk-2x'' ^ /cr?o , 

with r]j > (j = 1, ■■■,n), % 7^ 0, n > 3, and 1 < k < n — 1. Then the unique positive 
root X* of X satisfies the following. 
(1) //x(l) <0, then 

1 /2 

^ < X < 7 TT ; — • (2) 

\[n - k) + [n - k - Ijrjn-i H h m+i J 



(2) //x(l) >0, t/ien 

Q ^ ^* ^ / + 2r?fc_2 + ■ ■ ■ + /c?7o 

~ V (n - A;) + (n - A; - 1)t/„_i H h 7?fc+i 

The bounds can be adjusted like in Theorem 12.31 if ?7fc+i, f?fc-i, or more coefficients vanish. 
There are several possible strategies for finding a proper starting point to begin the com- 
putation of the roots of (p if they exist: one can periodically compute sgn{(j)) at an iterate, 
and use that iterate as a starting point if sgn{4>) = —1, or one can simply first compute 
X* and and only then start the computation of the roots of (ji, using x* as a starting point 
(assuming, as we did, that (j){x*) < 0). The complexity of this detection phase is 0{n). 

We remark here that there exist other methods to detect if a polynomial has real roots, 
such as, e.g., Sturm sequences. These methods have similar complexity, but they generally 
do not produce an appropriate starting point in [r,R], which is essential. 




3 Computation of the roots 

In this section we assume that (j) has two positive roots r and R and that we are given a 
point X such that r < x < R, which can be obtained in the way explained at the end of 
the previous section. We then propose a method to compute the roots iteratively, with the 
iterates converging monotonically to the roots from inside the interval [r, R] . The main 
idea behind this method is to approximate (p at a given point by a similar but simpler 
function that dominates (j) on [r, R]. Its roots will be approximations to the roots of (p, and 
the method then continues iteratively from those approximations. The following theorem 
derives the approximation to <j). 

Theorem 3.1 Let the polynomial 

(j)(x) = x"' + vn-ix"-'^ + ■■■ + m+ix''^^ - Vkx^ + m-ix'"'^ + --- + V0 , 

with r]j > (j = 1, ■■■,n), 7^ 0, n > 3, and 1 < k < n — 1, have two positive roots r 
and R with r < R and let 

(x) = x" + 7?n-ia;""^ + • • • + m+ix^^^ , (4) 
(p2{x) = -rikx^ + 7]k-ix^~^ H Vriix + 7]Q . (5) 
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Then for r < x < R, the trinomial 

/(x) = ax"-/3x'= + 7, (6) 

where 

a = (x)>0, 
n 

/3 = -ixi-V2(S)>0, 

7 = - X {^(p'i{x) + ^02(^)) > , 

has two positive zeros ri and r2 with r < ri < r2 < R, and f{x) > (j){x) for x > 0. 

Proof. Our goal is to approximate (p at a. certain point r < x < R, for which (j){x) < 0, 
by a function / that agrees at this point with (j) in (at least) function and first derivative 
values. In addition, / needs to dominate (f) for x > 0. Clearly, a straightforward linear 
approximation is not possible as (j) is composed of terms that are both convex and concave. 
We solve this problem by constructing separate approximations for and (j)2, whose sum 
is (p, and which were defined in the statement of the theorem. 
The transformation of variables w = x^ transforms (j)i{x) into 

(pi[w ' )=w + rin~iw " H Vrjk+iw " . 

This is a concave function of so that it is dominated by its linear approximation aiw+a2 
at a point w = x^. A straightforward calculation shows that 

a, = (-w"^) <p[{w'/-) = -x'--^[{x) , 

«2 = <Pl(w^^"') — CtlW = 4>l(x) X((>i(x) , 

n 

where the derivatives are with respect to x. Clearly, ai > and 

ot2 = M^) - -x(j)[{x) = - (r/„„ix"^i + 2r/„„2S""^ + ■ ■ ■ + {n - k - l)r/fc+ix'=+^) > . 

n n \ J 

We have obtained an approximation to (/>i(x) of the form aix"+a2, with aix"+a2 > <i>\{x) 
for x > 0. 

On the other hand, the transformation y = x^ transforms 4>2{x) into 

My^^'') = -VkV + Vk-iy^ + ■■■ + mv^'^ + r/o , 

a concave function of y. It is therefore dominated by its linear approximation Piy + P2 at 
a point y = x^ , with 

/3i = (^y"^) <t^'2{t'') = ^s'-V^(x) , 
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Since 



Mx) - -^X(j)2ix) = - [pt^ix^ ^ + 2r]k-2X^ ^ + • • • + (A; - l)r/ix + r/oj , 

1 k 
we have that /32 = 4>2{x) — —xcjy'nix) > 0, from which —(j)2(x) > 4>2{x). Because 4)ix) = 

k _ _ X 

(j)i{x) + (j)2ix) < 0, and therefore (j)2ix) < — < 0, this means that 02 (^) < 0' that 

/3i < 0, and we have obtained an approximation to (f)2{x) of the form f3ix^ + /32, with 

/3i2;*^ + /32 ^ 02 (ic) for x > 0. Consequently, our first-order approximation / to at x is 

given by the trinomial 

f{x) = aix" + i3ix'' + a2 + /32 , 

which corresponds to ([6]) in the statement of the theorem. It satisfies f{x) > 4>{x), is of 
the same form as 0, and, since f{x) < 0, it has two roots ri and r2 with r < ri < r2 < R- 

D 

An algorithm to compute r and R can now be based on Theorem 13.11 in a standard 
way: starting from any initial point on the interval {r,R), /'s largest root can be used as 
the next iterate in the computation of R. Convergence is monotonic from the left. An 
analogous algorithm is obtained for r by considering the smallest root of /. In this case, 
convergence is monotonic from the right. The function (p' has either one root, which must 
lie in (r, R), or two roots, one in (0, r) and one in (r, R), so that cp' {r)(j)' (R) ^ 0. It is then 
a technical exercise to show that the order of convergence of these algorithms is quadratic. 

However, to make the aforementioned algorithms implementable, we need a method 
to compute the roots of / itself from inside the interval determined by its roots. The 
basis for such a method is provided by the following theorem, which derives a first order 
approximation to / that dominates it, and whose roots can be computed explicitly. 

Theorem 3.2 Let the trinomial f{x) = ax"' — j5x^ + 7 with a,/3,7 > 0, n > 3, and 
1 < k < n — 1, have two positive roots ri and r2 with ri < r2- Then for ri < x < r2, the 
function 

/i(x) = ^^-/3x'= + 7, (7) 
e — x^ 

where 

§=- > , 
n 

n + k _r. 

e = x'^ > x'^ , 

n 



has two positive zeros si and S2 with ri < si < S2 < r2, and h{x) > f{x) for < x < 
(1 + n/k)x. 

Proof. With the transformation of variables, y = x^, f can be written as f{y^^^) = 
ay"^/^ — Py + j. If the function y"/*'' is approximated at y = x'' to first order by 6/ {e — y), 
then this is equivalent to approximating y""/'^ to first order by {—l/6)y-\-e/5. Since y~"/^ 
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is convex and its approximation is linear, this implies that {—l/5)y + e/5 < y "Z'^, and 
therefore that 5/{e — y) > y"/'^, or 

A 



k 



Consequently, h{x) > f{x) for x > and, because h{x) = f{x) < 0, it has two positive 
roots si and S2 that satisfy ri < si < S2 < r2. The constants 5 and e are computed from 
the first order approximation conditions 



as 



5 = -yi+"/'= = -x'=+", 
n n 

n + k _ n + k _u 

e = y = X . 

n n 

We note that h{x) becomes unbounded as x — > (1 + n/k)x. This concludes the proof. O 
Figure [2] shows the functions (j) < f < h for the polynomial 

q{z) = z^ + z'^ + + ^z^ + I5z^ - 2z^ + {i + l)z-A, (8) 

with k = 3 and x = 1.02. 

The approximation /i of / in Theorem 13.21 leads to an iterative method for the com- 
putation of the roots of /, by computing the roots of h and then using the smallest and 
largest of those roots as the next iterates for the computation of ri and r2, respectively. 
As was the case for the roots of (j), we obtain quadratic and monotonic convergence to the 
roots of /, although now the roots of the approximation can be computed explicitly since 
they are the roots of a quadratic. This follows by setting h{x) = 0, which is equivalent to 

Px^'' - (e/3 + 7)x'= + {a6 + £7) = , 

a quadratic equation in x''. That both roots of this quadratic are real follows directly from 
the properties of h. Once available, the appropriate root of / serves as the next iterate in 
the computation of the roots of (p. 

The main computational effort in our approach to compute the positive roots of (j) in 
Theorem 11.11 for a fixed parameter k is concentrated in the computation of the coefficients 
of the trinomial /, which requires 0{n) arithmetic operations. The computation of the 
roots of / itself is far less costly. Furthermore, the iterative process can be stopped at any 
moment, since each iterate provides a correct bound on the corresponding root of (p. This 
is precisely what we set out to obtain. 

Example. 

The motivation for this work was the absence of a method converging from inside the 
interval [r, ii], which means that there is no equivalent method to compare our methods 
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to. Instead we will illustrate the method outlined above at the hand of an example, 
namely, the polynomial q defined in ([8]). For this polynomial, we computed the roots of 
the corresponding real polynomial (p for /c = 3 to a relative accuracy of 10" ^2 _ The steps 
involved in this process with the corresponding number of iterations are listed below. All 
computations are carried out to the same aforementioned relative accuracy of 10"^^. 

Step 1: Compute a starting point with Corollary 12.11 for the computation of x*, the positive 
root of X- 

Step 2: Compute x* (4 Newton steps). 

Step 3: Starting from x*, compute the roots of (j) with the help of the trinomials /, (6 iterations 
with the method based on Theorem 13.11 for each root). 

The number of iterations, necessary to compute the largest root of / with the method 
from Theorem 13. 2| was 9, 7, 5, 3, 2, and 1, corresponding to the six times such a root 
needed to be computed. For the smallest root, we obtained 6, 5, 4, 2, 1, and 1 iterations. 
We observed that this decrease in the number of iterations required for the roots of / is 
typical, regardless of the degree of the polynomial. 

The number of iterations in the computation of the roots of (j) does not vary sig- 
nificantly with increasing degree, although the number of Newton steps necessary for the 
computation of x* tends to increase with increasing degree and may require an accelerated 
Newton method. On the other hand, it is usually not necessary to accurately compute x* . 




Figure 2: The functions (p < f < h for q{z), k = 3, and x = 1.02. 
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